WordSeer: Exploring Language Use in Literary Text

نویسندگان

  • Aditi Muralidharan
  • Marti Hearst
چکیده

Increasing numbers of primary and secondary source texts in the humanities have been digitized in recent years. Humanities scholars who want to study these new collections in depth need computational assistance because of their large scale. We have built WordSeer, a text analysis tool that includes visualizations and works on the grammatical structure of text extracted using highly accurate off-the shelf natural language processing tools. We have focused on the task of exploring language use patterns in a collection of North American slave narratives, but the technique is applicable to any text collection. Our preliminary user studies with humanities scholars show that WordSeer makes it easier for them to translate their questions into queries and find answers to their questions compared to a standard keyword-based search interface. In this paper, we present the system currently under development and describe text analysis features we plan to include in the next iteration.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Supporting exploratory text analysis in literature study

We present WordSeer, an exploratory analysis environment for literary text. Literature study is a cycle of reading, interpretation, exploration, and understanding. While there is now abundant technological support for reading and interpreting literary text in new ways through text-processing algorithms, the other parts of the cycle—exploration and understanding—have been relatively neglected. W...

متن کامل

Generic Analysis of Literary Translation: A Case Study of Contemporary English Short Stories

Translation of a literary text is a difficult task, for understanding literature requires knowledge of various linguistic levels of a literary text in addition to strategies and methods of translation. To this should still be added cognitive-based translation training which helps practitioners preserve the aesthetic aspects of a literary text. Focusing on short story as a genre with both ...

متن کامل

Editorial Volume 5, Issue1

Applied Literature, however, does not have literature at its centre. Literature in this domain is a tool to solve problems and achieve goals. Using literature to teach and learn languages, the application of literature to language education, is a very handy example. Health Humanities (by Crawford, et al. and reviewed by A. Ramazani in our Journal's previous issue) comprises chapters on how lite...

متن کامل

برجسته سازی در خطبۀ فدکیه حضرت زهرا(ع)

Foregrounding is one of the contemporary literary theories, which from a literary perspective to texts, in prose or verse, endeavors to explain and analyze those effective features and elements in the body of the discourse which rhetorically distinguish literary texts from ordinary ones. According to the Formalists, foregrounding is achieved through diminishing or increasing the rules. In other...

متن کامل

INFO256 Project Report Implementation and Evaluation of Xtract in WordSeer

Natural languages are full of word collocations that frequently co-occur and correspond to arbitrary word usages. They appear in both technical and non-technical textual corpora and often have specific significance in individual contexts. Accurately retrieving and identifying collocations from a given corpus in an unsupervised manner is imperative to understanding and automatically generating t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011